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ABSTRACT: Results of a search for H —>• tt decays are presented, based on the full set of 
proton-proton collision data recorded by the ATLAS experiment at the LHC during 2011 
and 2012. The data correspond to integrated luminosities of 4.5 fb” 1 and 20.3 ftA 1 at 
centre-of-mass energies of \fs = 7 TeV and = 8 TeV respectively. All combinations of 
leptonic (r — > lvi> with t = e, fj) and hadronic (r —> hadrons v) tau decays are considered. 
An excess of events over the expected background from other Standard Model processes 
is found with an observed (expected) significance of 4.5 (3.4) standard deviations. This 
excess provides evidence for the direct coupling of the recently discovered Higgs boson to 
fermions. The measured signal strength, normalised to the Standard Model expectation, of 
H = 1.43 is consistent with the predicted Yukawa coupling strength in the Standard 
Model. 
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1 Introduction 


The investigation of the origin of electroweak symmetry breaking and, related to this, the 
experimental confirmation of the Brout-Englert-Higgs mechanism [1-6] is one of the prime 
goals of the physics programme at the Large Hadron Collider (LHC) [7]. With the discovery 
of a Higgs boson with a mass of approximately 125 GeV by the ATLAS [ 8 ] and CMS [9] 
collaborations, an important milestone has been reached. More precise measurements of 
the properties of the discovered particle [ 10 , 11 ] as well as tests of the spin-parity quantum 
numbers [12-14] continue to be consistent with the predictions for the Standard Model 
(SM) Higgs boson. 

These measurements rely predominantly on studies of the bosonic decay modes, H —>• 77 , 
H —>• ZZ* and H —> WW* . To establish the mass generation mechanism for fermions as 
implemented in the SM, it is of prime importance to demonstrate the direct coupling of 
the Higgs boson to fermions and the proportionality of its strength to mass [15]. The most 
promising candidate decay modes are the decays into tau leptons, H —> rr, and bottom 
quarks (h-quarks), H —> bb. Due to the high background, the search for decays to bb is 
restricted to Higgs bosons produced in modes which have a more distinct signature but a 
lower cross-section, such as H production with an associated vector boson. The smaller 
rate of these processes in the presence of still large background makes their detection chal¬ 
lenging. More favourable signal-to-background conditions are expected for H —> rr decays. 
Recently, the CMS Collaboration published evidence for H —> rr decays at a significance in 
terms of standard deviations of 3.2er [16], and an excess corresponding to a significance of 
2.1cr in the search for H —» bb decays [17]. The combination of channels provides evidence 
for fermionic couplings with a significance of 3.8cr [18]. The yield of events in the search for 
H —» bb decays observed by the ATLAS Collaboration has a signal significance of 1.4cr [19]. 
The Tevatron experiments have observed an excess corresponding to 2.8a in the H —>• bb 
search [ 20 ]. 

In this paper, the results of a search for H —> rr decays are presented, based on the 
full proton-proton dataset collected by the ATLAS experiment during the 2011 and 2012 
data-taking periods, corresponding to integrated luminosities of 4.5 fb _1 at a centre-of- 
mass energy of \fs = 7 TeV and 20.3 fb -1 at yfs = 8 TeV. These results supersede the 
earlier upper limits on the cross section times the branching ratio obtained with the 7 
TeV data [21]. All combinations of leptonic (r —> iuu with i = e, fi) and hadronic (r —>• 
hadrons u) tau decays are considered . 1 The corresponding three analysis channels are 
denoted by Ti cp Ti cp , Ti ep Th a d, and RiadThad in the following. The search is designed to be 
sensitive to the major production processes of a SM Higgs boson, i.e. production via gluon 
fusion (ggF) [22], vector-boson fusion (VBF) [23], and associated production ( VH ) with 
V = W or Z. These production processes lead to different final-state signatures, which are 
exploited by defining an event categorisation. Two dedicated categories are considered to 
achieve both a good signal-to-background ratio and good resolution for the reconstructed 
rr invariant mass. The VBF category, enriched in events produced via vector-boson fusion, 


throughout this paper the inclusion of charge-conjugate decay modes is implied. 



is defined by the presence of two jets with a large separation in pseudorapidity. 2 The 
boosted category contains events where the reconstructed Higgs boson candidate has a large 
transverse momentum. It is dominated by events produced via gluon fusion with additional 
jets from gluon radiation. In view of the signal-to-background conditions, and in order 
to exploit correlations between final-state observables, a multivariate analysis technique, 
based on boosted decision trees (BDTs) [24-26], is used to extract the final results. As 
a cross-check, a separate analysis where cuts on kinematic variables are applied is carried 
out. 

2 The ATLAS detector and object reconstruction 

The ATLAS detector [27] is a multi-purpose detector with a cylindrical geometry. It com¬ 
prises an inner detector (ID) surrounded by a thin superconducting solenoid, a calorimeter 
system and an extensive muon spectrometer in a toroidal magnetic held. The ID track¬ 
ing system consists of a silicon pixel detector, a silicon microstrip detector (SCT), and a 
transition radiation tracker (TRT). It provides precise position and momentum measure¬ 
ments for charged particles and allows efficient identification of jets containing 6-hadrons 
( 6 -jets) in the pseudorapidity range 1 77 1 < 2.5. The ID is immersed in a 2 T axial magnetic 
held and is surrounded by high-granularity lead/liquid-argon (LAr) sampling electromag¬ 
netic calorimeters which cover the pseudorapidity range |? 7 | < 3.2. A steel/scintillator tile 
calorimeter provides hadronic energy measurements in the central pseudorapidity range 
(|77I < 1.7). In the forward regions (1.5 < \r]\ < 4.9), the system is complemented by two 
end-cap calorimeters using LAr as active material and copper or tungsten as absorbers. The 
muon spectrometer (MS) surrounds the calorimeters and consists of three large supercon¬ 
ducting eight-coil toroids, a system of tracking chambers, and detectors for triggering. The 
deflection of muons is measured within 1 77 I < 2.7 by three layers of precision drift tubes, and 
cathode strip chambers in the innermost layer for |ry| > 2.0. The trigger chambers consist 
of resistive plate chambers in the barrel (|r/| < 1.05) and thin-gap chambers in the end-cap 
regions (1.05 < 1 77 ! < 2.4). 

A three-level trigger system [28] is used to select events. A hardware-based Level-1 
trigger uses a subset of detector information to reduce the event rate to a value to 75 kHz 
or less. The rate of accepted events is then reduced to about 400 Hz by two software-based 
trigger levels, Level-2 and the Event Filter. 

The reconstruction of the basic physics objects used in this analysis is described in 
the following. The primary vertex referenced below is chosen as the proton-proton vertex 
candidate with the highest sum of the squared transverse momenta of all associated tracks. 

Electron candidates are reconstructed from energy clusters in the electromagnetic 
calorimeters matched to a track in the ID. They are required to have a transverse en- 

2 The ATLAS experiment uses a right-handed coordinate system with its origin at the nominal interaction 
point (IP) in the centre of the detector and the z-axis along the beam direction. The x-axis points from 
the IP to the centre of the LHC ring, and the y- axis points upward. Cylindrical coordinates (r, (j>) are used 
in the transverse (x, y) plane, cp being the azimuthal angle around the beam direction. The pseudorapidity 
is defined in terms of the polar angle 8 as y = — lntan(0/2). The distance A R in the y-cp space is defined 
as A R = ■sj (A r/) 2 + (A <p) 2 . 
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ergy, Ex = Esin 9, greater than 15 GeV, to be within the pseudorapidity range 1 77 I < 2.47, 
and to satisfy the medium shower shape and track selection criteria defined in ref. [29]. 
Candidates found in the transition region between the end-cap and barrel calorimeters 
(1.37 < |p| < 1.52) are not considered. Typical reconstruction and identification efficiencies 
for electrons satisfying these selection criteria range between 80% and 90% depending on 
E'y and 77 . 

Muon candidates are reconstructed using an algorithm [30] that combines information 
from the ID and the MS. The distance between the z-position of the point of closest approach 
of the muon inner-detector track to the beam-line and the z-coordinate of the primary vertex 
is required to be less than 1 cm. This requirement reduces the contamination due to cosmic- 
ray muons and beam-induced backgrounds. Muon quality criteria such as inner detector 
hit requirements are applied to achieve a precise measurement of the muon momentum 
and reduce the misidentihcation rate. Muons are required to have a momentum in the 
transverse plane px > 10 GeV and to be within |?y| < 2.5. Typical efficiencies for muons 
satisfying these selection criteria are above 95% [30]. 

Jets are reconstructed using the anti-A;* jet clustering algorithm [31, 32] with a radius 
parameter R = 0.4, taking topological energy clusters [33] in the calorimeters as inputs. 
Jet energies are corrected for the contribution of multiple interactions using a technique 
based on jet area [34] and are calibrated using px- and ^-dependent correction factors 
determined from data and simulation [35-37]. Jets are required to be reconstructed in the 
range |p| < 4.5 and to have px > 30 GeV. To reduce the contamination of jets by additional 
interactions in the same or neighbouring bunch crossings (pile-up), tracks originating from 
the primary vertex must contribute a large fraction of the px when summing the scalar px 
of all tracks in the jet. This jet vertex fraction (JVF) is required to be at least 75% (50%) 
for jets with |p| < 2.4 in the 7 TeV (8 TeV) dataset. Moreover, for the 8 TeV dataset, the 
JVF selection is applied only to jets with px < 50 GeV. Jets with no associated tracks are 
retained. 

In the pseudorapidity range |?y| < 2.5, 6 -jets are selected using a tagging algorithm [38]. 
The 6 -jet tagging algorithm has an efficiency of 60-70% for 6 -jets in simulated tt events. 
The corresponding light-quark jet misidentihcation probability is 0.1-1%, depending on the 
jet’s px and p. 

Hadronically decaying tau leptons are reconstructed starting from clusters of energy 
in the electromagnetic and hadronic calorimeters. The Th a d 3 reconstruction is seeded by 
the anti-/p jet finding algorithm with a radius parameter R = 0.4. Jet-specific cleaning 
selection such as the JVF requirement is not needed for the tau candidate seeds, due to 
stricter vertex requirements. Tracks with px > 1 GeV within a cone of radius 0.2 around 
the cluster barycentre are matched to the Th a d candidate, and the r^ad charge is determined 
from the sum of the charges of its associated tracks. The rejection of jets is provided in a 
separate identification step using discriminating variables based on tracks with px > 1 GeV 
and the energy deposited in calorimeter cells found in the core region (A R < 0 . 2 ) and in the 
region 0.2 < A R < 0.4 around the Th a d candidate’s direction. Such discriminating variables 

3 In the following, the Thad symbol always refers to the visible decay products of the r hadronic decay. 
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are combined in a boosted decision tree and three working points, labelled tight, medium 
and loose [39], are defined, corresponding to different Thad identification efficiency values. 

In this analysis, Thad candidates with pp > 20 GeV and |ry| < 2.47 are used. The Thad 
candidates are required to have charge ±1, and must be 1- or 3-track (prong) candidates. 
In addition, a sample without the charge and track multiplicity requirements is retained for 
background modelling in the ThadAiad channel, as described in section 6.2. The identifica¬ 
tion efficiency for Thad candidates satisfying the medium criteria is of the order of 55-60%. 
Dedicated criteria [39] to separate Thad candidates from misidentified electrons are also ap¬ 
plied, with a selection efficiency for true Th a d decays of 95%. The probability to misidentify 
a jet with pp > 20 GeV as a Th a d candidate is typically 1 - 2 %. 

Following their reconstruction, candidate leptons, hadronically decaying taus and jets 
may point to the same energy deposits in the calorimeters (within A R < 0.2). Such 
overlaps are resolved by selecting objects in the following order of priority (from highest to 
lowest): muons, electrons, Th a d, and jet candidates. For all channels, the leptons that are 
considered in overlap removal with Th a d candidates need to only satisfy looser criteria than 
those defined above, to reduce misidentified Th a d candidates from leptons. The pp threshold 
of muons considered in overlap removal is also lowered to 4 GeV. 

The missing transverse momentum (with magnitude El j? lss ) is reconstructed using the 
energy deposits in calorimeter cells calibrated according to the reconstructed physics objects 
(e, 7 , Thad; jets and p) with which they are associated [40]. The transverse momenta of 
reconstructed muons are included in the F?™ 1SS calculation, with the energy deposited by 
these muons in the calorimeters taken into account. The energy from calorimeter cells 
not associated with any physics objects is scaled by a soft-term vertex fraction and also 
included in the Elf lss calculation. This fraction is the ratio of the summed scalar pp of 
tracks from the primary vertex not matched with objects to the summed scalar pp of all 
tracks in the event also not matched to objects. This method allows to achieve a more 
accurate reconstruction of the E™ ss in high pile-up conditions [41]. 

3 Data and simulated samples 

After data quality requirements, the integrated luminosities of the samples used are 4.5 fb _1 
at y/s = 7 TeV and 20.3 fb _1 at y/s = 8 TeV. 

Samples of signal and background events were simulated using various Monte Carlo 
(MC) generators, as summarised in table 1. The generators used for the simulation of the 
hard-scattering process and the model used for the simulation of the parton shower, of the 
hadronisation and of the underlying-event activity are listed. In addition, the cross section 
values to which the simulation is normalised and the perturbative order in QCD of the 
respective calculations are provided. 

The signal contributions considered include the three main processes for Higgs boson 
production at the LHC: ggF, VBF, and associated VH production processes. The contribu¬ 
tions from the associated ttH production process are found to be small and are neglected. 
The ggF and VBF production processes are simulated with Powheg [42-45] interfaced to 
PYTHIA8 [46]. In the POWHEG event generator, the CT10 [47] parameterisation of the 
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parton distribution functions (PDFs) is used. The overall normalisation of the ggF process 
is taken from a calculation at next-to-next-to-leading-order (NNLO) [48-53] in QCD, in¬ 
cluding soft-gluon resummation up to next-to-next-to-leading logarithm terms (NNLL) [54], 
Next-to-leading order (NLO) electroweak (EW) corrections are also included [55, 56]. Pro¬ 
duction by VBF is normalised to a cross section calculated with full NLO QCD and EW 
corrections [57-59] with an approximate NNLO QCD correction applied [60]. The associated 
VH production process is simulated with Pythia8. The Cteq6L1 [61] parameterisation 
of PDFs is used for the PYTHIA8 event generator. The predictions for VH production are 
normalised to cross sections calculated at NNLO in QCD [62], with NLO EW radiative 
corrections [63] applied. 

Additional corrections to the shape of the generated px distribution of Higgs bosons 
produced via ggF are applied to match the distribution from a calculation at NNLO includ¬ 
ing the NNLL corrections provided by the HRES2.1 [64] program. In this calculation, the 
effects of finite masses of the top and bottom quarks [64, 65] are included and dynamical 
renormalisation and factorisation scales, pr, pp = \J + p\. are used. A reweighting is 
performed separately for events with no more than one jet at particle level and for events 
with two or more jets. In the latter case, the Higgs boson px spectrum is reweighted to 
match the MinLo HJJ predictions [66]. The reweighting is derived such that the inclusive 
Higgs boson px spectrum and the px spectrum of events with at least two jets match the 
HRES2.1 and MinLo HJJ predictions respectively, and that the jet multiplicities are in 
agreement with (N)NLO calculations from JetVHeto [67-69]. 

The NLO EW corrections for VBF production depend on the px of the Higgs boson, 
varying from a few percent at low px to ~ 20% at px = 300 GeV [70]. The px spectrum 
of the VBF-produced Higgs boson is therefore reweighted, based on the difference between 
the POWHEG+PYTHIA calculation and the HAWK [57, 58] calculation which includes these 
corrections. 

The main and largely irreducible Z/ 7 * — > tt background is modelled using Z/ 7 * — > pp 
events from data, 4 where the muon tracks and associated energy depositions in the calorime¬ 
ters are replaced by the corresponding simulated signatures of the final-state particles of the 
tau decay. In this approach, essential features such as the modelling of the kinematics of 
the produced boson, the modelling of the hadronic activity of the event (jets and underlying 
event) as well as contributions from pile-up are taken from data. Thereby the dependence 
on the simulation is minimised and only the t decays and the detector response to the 
tau-lepton decay products are based on simulation. By requiring two isolated, high-energy 
muons with opposite charge and a dimuon invariant mass m w > 40 GeV, Z —>• pp events 
can be selected from the data with high efficiency and purity. To replace the muons in 
the selected events, all tracks associated with the muons are removed and calorimeter cell 
energies associated with the muons are corrected by subtracting the corresponding energy 
depositions in a single simulated Z —> event with the same kinematics. Finally, both 

the track information and the calorimeter cell energies from a simulated Z —>• tt decay are 

4 These processes are hereafter for simplicity denoted by Z — > tt and Z —> ^Lfj, respectively, even though 
the whole continuum above and below the Z peak is considered. 
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added to the data event. The decays of the tau leptons are simulated by TAUOLA [71]. 
The tau lepton kinematics are matched to the kinematics of the muons they are replacing, 
including polarisation and spin correlations [72], and the mass difference between the muons 
and the tau leptons is accounted for. This hybrid sample is referred to as embedded data 
in the following. 

Other background processes are simulated using different generators, each interfaced 
to PYTHIA [46, 73] or HERWIG [74, 75] to provide the parton shower, hadronisation and the 
modelling of the underlying event, as indicated in table 1. For the HERWIG samples, the 
decays of tau leptons are also simulated using TAUOLA [71]. Photon radiation from charged 
leptons for all samples is provided by PHOTOS [76]. The samples for VP/Z’+jets production 
are generated with ALPGEN [77], employing the MLM matching scheme [78] between the 
hard process (calculated with LO matrix elements for up to five partons) and the parton 
shower. For WW production, the loop-induced gg —> WW process is also generated, using 
the GG2WW [79] program. In the AcerMC [80], ALPGEN, and HERWIG event generators, 
the CTEQ6L1 parameterisation of the PDFs is used, while the CT10 parameterisation is 
used for the generation of events with GG2WW. The normalisation of these background 
contributions is either estimated from control regions using data, as described in section 6, 
or the cross sections quoted in table 1 are used. 

For all samples, a full simulation of the ATLAS detector response [81] using the Geant4 
program [82] was performed. In addition, events from minimum-bias interactions were 
simulated using the AU2 [83] parameter tuning of PYTHIA8. The AU2 tune includes 
the set of optimized parameters for the parton shower, hadronisation, and multiple parton 
interactions. They are overlaid on the simulated signal and background events according to 
the luminosity profile of the recorded data. The contributions from these pile-up interactions 
are simulated both within the same bunch crossing as the hard-scattering process and in 
neighbouring bunch crossings. Finally, the resulting simulated events are processed through 
the same reconstruction programs as the data. 

4 Event selection and categorisation 
4.1 Event selection 

Single lepton, dilepton and di-Thad triggers were used to select the events for the analysis. A 
summary of the triggers used by each channel at the two centre-of-mass energies is reported 
in table 2. Due to the increasing luminosity and the different pile-up conditions, the online 
PT thresholds increased during data-taking in 2011 and again for 2012, and more stringent 
identification requirements were applied for the data-taking in 2012. The pt requirements 
on the objects in the analysis are usually 2 GeV higher than the trigger requirements, to 
ensure that the trigger is fully efficient. 

In addition to applying criteria to ensure that the detector was functioning properly, 
requirements to increase the purity and quality of the data sample are applied by rejecting 
non-collision events such as cosmic rays and beam-halo events. At least one reconstructed 
vertex is required with at least four associated tracks with pt > 400 MeV and a posi- 
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Signal (m H = 125 GeV) 

MC generator 

a x B [pb] 
y/s = 8 TeV 

ggF, H -> TT 

Powheg [42-45] 

1.22 

NNLO+NNLL 

[48-53, 84] 


+ Pythia8 [46] 





VBF, H^tt 

Powheg + Pythia8 

0.100 


(N)NLO 

[57-59, 84] 

WH, H -*tt 

Pythia8 

0.0445 


NNLO 

[62, 84] 

ZH, H -+tt 

Pythia8 

0.0262 


NNLO 

[62, 84] 

Background 

MC generator 

a x B [pb] 

^ = 8 TeV 

W(-> tv ), [l = e,/x,r) 

Alpgen [77]+Pythia8 

36800 


NNLO 

[85, 86] 

Z/'fir+U), 

60 GeV < m u < 2 TeV 

Alpgen+Pythia8 

3910 


NNLO 

[85, 86] 

Z/'fir+U), 

10 GeV < m u < 60 GeV 

Alpgen+Herwig [87] 

13000 


NNLO 

[85, 86] 

VBF Z/-i*{-+ It) 

Sherpa [88] 

1.1 


LO 

[88] 

tt 

Powheg + Pythia8 

2531 

NNLO+NNLL 

[89-94] 

Single top : Wt 

Powheg + Pythia8 

22+ 


NNLO 

[95] 

Single top : s-channel 

Powheg + Pythia8 

5.6+ 


NNLO 

[96] 

Single top : f-channel 

AcerMC [80 ]+Pythia6 [73] 

87.8+ 


NNLO 

[97] 

qq WW 

Alpgen+Herwig 

54t 


NLO 

[98] 

gg -> WW 

gg2WW [79]+Herwig 

1.4+ 


NLO 

[79] 

WZ, ZZ 

Herwig 

30+ 


NLO 

[98] 

H ->WW 

same as for H —> tt signal 

4.7t 




Table 1. Monte Carlo generators used to model the signal and the background processes a 

y/s = 8 TeV. The cross sections times branching fractions (er x B) used for the normalisation o 


some processes (many of these are subsequently normalised to data) are included in the last column 
together with the perturbative order of the QCD calculation. For the signal processes the H — > tt 
SM branching ratio is included, and for the W and Z /y* background processes the branching ratios 
for leptonic decays (£ = e, /i, r) of the bosons are included. For all other background processes, 
inclusive cross sections are quoted (marked with a f). 


tion consistent with the beam spot. It has been verified that, after object selection cuts, 
contributions from other primary vertices are negligible. 

With respect to the object identification requirements described in section 2, tighter 
criteria are applied to address the different background contributions and compositions 
in the different analysis channels. Higher p t thresholds are applied to electrons, muons, 
and T} ia q candidates according to the trigger conditions satisfied by the event, as listed in 
table 2. For the channels involving leptonic tau decays, Ti ep Ti ep and Ti ep 7h a d, additional 
isolation criteria for electrons and muons, based on tracking and calorimeter information, 
are used to suppress the background from misidentified jets or from semileptonic decays 
of charm and bottom hadrons. The calorimeter isolation variable I(Eq',AR) is defined 
as the sum of the total transverse energy in the calorimeter in a cone of size A R around 
the electron cluster or the muon track, divided by the F/p of the electron cluster or the 
Pt of the muon respectively. The track-based isolation J(pt,A.R) is defined as the sum 
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= 7 TeV 

Trigger 

Trigger 

level 

thresholds, 
p T [GeV] 

Analysis level thresholds [GeV] 

TlepTlep TlepThad ThadThad 

Single electron 

20-22 

Pt > 22 - 24 p e T > 25 

p% > 10 6r: p t t > 20 

Single muon 

18 

p :> 20 

p%? >10 p£ > 22 

pti > 20 fJ ' T ' Pt > 20 

p e T > 15 

Di-electron 

12/12 

Pt > 15 

6e: Pt 2 > 15 

Di-Thad 

29/20 

pTy > 35 
TT: Pt 2 > 25 

y/s = 8 TeV 

Trigger 

Trigger 

level 

thresholds, 
p T [GeV] 

Analysis level thresholds [GeV] 

TlepTlep TlepThad ThadThad 

Single electron 

24 

Pt > 26 

p£ > 10 Pt > 26 

Pt 1 > 26 6r: p7 > 20 

Pt 2 > 15 

Single muon 

24 

p£ > 26 

Mr: p7 > 20 

Di-electron 

12/12 

Pt > 15 

6e: Pt 2 > 15 

Di-muon 

18/8 

pS. 1 > 20 

p- > 10 

Electron+muon 

12/8 

p e T > 15 
e/i: p!f, > 10 

Di-Thad 

29/20 

Pt > 35 
TT: Pt 2 > 25 


Table 2. Summary of the triggers used to select events for the different analysis channels at the 
two centre-of-mass energies. The transverse momentum thresholds applied at trigger level and in 
the analysis are listed. When more than one trigger is used, a logical OR is taken and the trigger 
efficiencies are calculated accordingly. 


of the transverse momenta of tracks within a cone of A R around the electron or muon 
track, divided by the Ep of the electron cluster or the muon pp respectively. The isolation 
requirements applied are slightly different for the two centre-of-mass energies and are listed 
in table 3. 

In the t had7"had channel, isolated taus are selected by requiring that there are no tracks 
with px > 0.5 GeV in an isolation region of 0.2 < A R < 0.6 around the tau direction. 
This requirement leads to a 12% (4%) efficiency loss for hadronic taus, while 30% (10%) of 
contamination from jets is rejected in 8 (7) TeV data. 
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^lepTlep 

"Hep Thad 

7 TeV 

Electrons 

8 TeV 

I(px, 0.4) < 0.06 
I(E T , 0.2) < 0.08 

I(p T , 0.4) < 0.06 

/(£x, 0.2) < 0.06 

/(px, 0.4) < 0.17 
I{E t , 0.2) < 0.09 

J(p T , 0.4) < 0.06 
J(-E t , 0.2) < 0.06 

7 TeV 

Muons 

8 TeV 

I(px, 0.4) < 0.06 
I(E t , 0.2) < 0.04 

/(p T , 0.4) < 0.06 
/(-E T , 0.2) < 0.06 

I(px, 0.4) < 0.18 
I{E t , 0.2) < 0.09 

/(p T , 0.4) < 0.06 
/(-E T , 0.2) < 0.06 


Table 3. Summary of isolation requirements applied for the selection of isolated electrons and 
muons at the two centre-of-mass energies. The isolation variables are defined in the text. 


After the basic lepton selection, further channel-dependent cuts are applied, as detailed 
in the following. The full event selection is summarised in table 4. 

The T[ e pTiep channel: exactly two isolated leptons with opposite-sign (OS) electric 
charges, passing the px threshold listed in table 2, are required. Events containing a 
Thad candidate are vetoed. For the Th a d candidates considered, the criteria used to reject 
electrons misidentified as Th a d candidates are tightened to a working-point of 85% signal 
efficiency [39]. 

In addition to the irreducible Z — > tt background, sizeable background contributions 
from Z —> ££ and from tt production are expected in this channel. Background contribu¬ 
tions from Z decays, but also from low mass resonances (charmonium and bottomonium), 
are rejected by requirements on the invariant mass m™ of the visible tau decay products, 
on the angle between the two leptons in the transverse plane and on E™ 1SS . To re¬ 

ject the large Z —> ££ contribution in events with same-flavour (SF) leptons (ee,pp), more 
stringent cuts on the visible mass and on E™ 1SS are applied for these events than for events 
with different-flavour (DF) leptons {ep). For SF final states, an additional variable named 
high-pT objects E™ 1SS (.E™ lss ’ HPTO ) is also used to reject background from Z/q* production. 
It is calculated from the high-px objects in the event, i.e. from the two leptons and from 
jets with px > 25 GeV. Due to the presence of real neutrinos, the two .E™ 1SS variables 
are strongly correlated for signal events but only loosely correlated for background from 
Z —> ee and Z —> pp decays. 

To further suppress background contributions from misidentified leptons 5 a minimum 
value of the scalar sum of the transverse momenta of the two leptons is required. Contribu¬ 
tions from tt events are further reduced by rejecting events with a 6 -jet with px > 25 GeV. 

Within the collinear approximation [99], i.e. assuming that the tau directions are given 
by the directions of the visible tau decay products and that the momenta of the neutri¬ 
nos constitute the missing transverse momentum, the tau momenta can be reconstructed. 

5 Misidentified leptons and Thad candidates are also referred to as “fake” in this paper. 
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For tau decays, the fractions of the tau momenta carried by the visible decay products , 6 
x Tj i = p v is,i/(Pvis,i +Pmis,i)> with i = 1 , 2 , are expected to lie in the interval 0 < x T j < 1 , 
and hence corresponding requirements are applied to further reject non-tau background 
contributions. 

Finally, to avoid overlap between this analysis and the search for H —> WW* —> Ivlv 
decays, the rr mass in the collinear approximation is required to satisfy the condition 
m coU > mz _ 25 Q e y 

The TiepThad channel: exactly one isolated lepton and one Th a d candidate with OS 
charges, passing the px thresholds listed in table 2, are required. The criteria used to 
reject electrons misidentified as Th a d are also tightened in this channel to a working-point 
of 85% signal efficiency [39]. 

The production of VF+jets and of top quarks constitute the dominant reducible back¬ 
ground in this channel. To substantially reduce the IF+jets contribution, a cut on the 
transverse mass" constructed from the lepton and the missing transverse momentum is ap¬ 
plied and events with mx > 70 GeV are rejected. Contributions from tt events are reduced 
by rejecting events with a 6 -jet with px > 30 GeV. 

The Th a dTh a d channel: one isolated medium r^ad candidate and one isolated tight Th a d 
candidate with OS charges are required. Events with electron or muon candidates are 
rejected. For all data, the missing transverse momentum must satisfy E!f lss > 20 GeV 
and its direction must either be between the two visible Thad candidates in (j) or within 
A (f> < 7t/4 of the nearest Th a d candidate. To further reduce the background from multijet 
production, additional cuts on the A R and pseudorapidity separation Ap between the two 
t h a d candidates are applied. 

With these selections, there is no overlap between the individual channels. 

4.2 Analysis categories 

To exploit signal-sensitive event topologies, two analysis categories are defined in an exclu¬ 
sive way. 

• The VBF category targets events with a Higgs boson produced via vector boson fusion 
and is characterised by the presence of two high-px jets with a large pseudorapidity 
separation (see table 4). The Ap(jiV 2 ) requirement is applied to the two highest- 
Px jets in the event. In the TiepThad channel, there is an additional requirement 
that > 40 GeV, to eliminate low-mass Z/ 7 * events. Although this category is 
dominated by VBF events, it also includes smaller contributions from ggF and VH 
production. 

• The boosted category targets events with a boosted Higgs boson produced via ggF. 
Higgs boson candidates are therefore required to have large transverse momentum, 

'’The variable p v i s is defined as the total momentum of the visible decay products of the tau lepton while 
p m is is defined as the momentum of the neutrino system reconstructed using the collinear approximation. 

7 rriT = V2 P l T ET aa ■ (1 — cos A cj>), where A <j> is the azimuthal separation between the directions of the 
lepton and the missing transverse momentum. 
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Channel 

Preselection cuts 

Tlep'Tep 

Exactly two isolated opposite-sign leptons 

Events with ri ia( j candidates are rejected 

30 GeV < to™ < 100 (75) GeV for DF (SF) events 
< 2.5 

E^ iss > 20 (40) GeV for DF (SF) events 

^miss.HPTO > 40 GeV for SF events 

pi% + p T 2 > 35 GeV 

Events with a 6-tagged jet with py > 25 GeV are rejected 

0.1 < x Tl , x T2 < 1 
m™ 11 > uiz — 25 GeV 

"Hep Thad 

Exactly one isolated lepton and one medium Th a d candidate with opposite charges 
to t < 70 GeV 

Events with a 6-tagged jet with py > 30 GeV are rejected 

TiadTiad 

One isolated medium and one isolated tight opposite-sign ri ia d-candidate 

Events with leptons are vetoed 

E^ iss > 20 GeV 

E™ lss points between the two visible taus in <p , or min[A0(r, E“ lss )] < 7 r/4 

0.8 < Ai?(T hadl ,T had2 ) < 2.4 

A^Thad^ThadJ < 1-5 

Channel 

VBF category selection cuts 

'HepT'lep 

At least two jets with p ^ > 40 GeV and p^ > 30 GeV 

Aj?(ji,j 2 ) > 2.2 

^lepTiad 

At least two jets with p ^ > 50 GeV and > 30 GeV 

A??(ji, j 2 ) > 3.0 
m™ > 40 GeV 

TiadTiad 

At least two jets with p^ 1 > 50 GeV and p^ 2 > 30 GeV 
p^? > 35 GeV for jets with p > 2.4 

Ap(ji, j 2 ) > 2.0 

Channel 

Boosted category selection cuts 

^lep^lep 

At least one jet with p T > 40 GeV 

All 

Failing the VBF selection 
p^ > 100 GeV 


Table 4. Summary of the event selection for the three analysis channels. The requirements used 
in both the preselection and for the definition of the analysis categories are given. The labels (1) 
and (2) refer to the leading (highest pr) and subleading final-state objects (leptons, Th a d, jets). The 
variables are defined in the text. 


p > 100 GeV. The p^ is reconstructed using the vector sum of the missing transverse 
momentum and the transverse momentum of the visible tau decay products. In the 
riepTiep channel, at least one jet with py > 40 GeV is required. The jet requirement 
selects a region of the phase space where the E!f lss of same-flavour events is well 
modelled by simulation. In order to define an orthogonal category, events passing 
the VBF category selection are not considered. This category also includes small 
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contributions from VBF and VH production. 

While these categories are conceptually identical across the three channels, differences 
in the dominant background contributions require different selection criteria. For both 
categories, the requirement on jets is inclusive and additional jets, apart from those passing 
the category requirements, are allowed. 

For the Thad'Thad channel, the so-called rest category is used as a control region. In this 
category, events passing the preselection requirements but not passing the VBF or boosted 
category selections are considered. This category is used to constrain the Z —> rr and 
multijet background contributions. The signal contamination in this category is negligible. 

4.3 Higgs boson candidate mass reconstruction 

The di-tau invariant mass is reconstructed using the missing mass calculator 

(MMC) [100]. This requires solving an underconstrained system of equations for six to 
eight unknowns, depending on the number of neutrinos in the rr final state. These un¬ 
knowns include the x-, y-. and ^-components of the momentum carried by the neutrinos 
for each of the two tau leptons in the event, and the invariant mass of the two neutrinos 
from any leptonic tau decays. The calculation uses the constraints from the measured x- 
and y-components of the missing transverse momentum, and the visible masses of both 
tau candidates. A scan is performed over the two components of the missing transverse 
momentum vector and the yet undetermined variables. Each scan point is weighted by its 
probability according to the E™ ms resolution and the tau decay topologies. The estimator 
for the rr mass is defined as the most probable value of the scan points. 

The MMC algorithm provides a solution for ~99% of the H —> rr and Z —> rr 
events. This is a distinct advantage compared to the mass calculation using the collinear 
approximation where the failure rate is higher due to the implicit collinearity assumptions. 
The small loss rate of about 1% for signal events is due to large fluctuations of the i?™ lss 
measurement or other scan variables. 

Figure 1 shows reconstructed mass distributions for H —> rr and Z —> rr 

events in the ri ep rh a d VBF and boosted categories. The mass resolution, defined as the 
ratio between the full width at half maximum (FWHM) and the peak value of the mass 
distribution (m pea k)) is found to be ~ 30% for all categories and channels. 

5 Boosted decision trees 

Boosted decision trees are used in each category to extract the Higgs boson signal from the 
large number of background events. Decision trees [24] recursively partition the parameter 
space into multiple regions where signal or background purities are enhanced. Boosting is 
a method which improves the performance and stability of decision trees and involves the 
combination of many trees into a single final discriminant [25, 26]. After boosting, the final 
score undergoes a transformation to map the scores on the interval —1 to +1. The most 
signal-like events have scores near 1 while the most background-like events have scores near 
- 1 . 
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< MC [GeV] 

(a) 



m T M T MC [GeV] 
(b) 


Figure 1. The reconstructed invariant rr mass, m^. MC for H —> rr (run = 125 GeV) and 
Z —^ tt events in MC simulation and embedding respectively, for events passing (a) the VBF 
category selection and (b) the boosted category selection in the Ti ep Th a d channel. 


Separate BDTs are trained for each analysis category and channel with signal and 
background samples, described in section 6, at y/s = 8 TeV. They are then applied to 
the analysis of the data at both centre-of-mass energies. The separate training naturally 
exploits differences in event kinematics between different Higgs boson production modes. 
It also allows different discriminating variables to be used to address the different back¬ 
ground compositions in each channel. A large set of potential variables was investigated, in 
each channel separately, and only those variables which led to an improved discrimination 
performance of the BDT were kept. For the training in the VBF category, only a VBF 
Higgs production signal sample is used, while training in the boosted category uses ggF, 
VBF, and VH signal samples. The Higgs boson mass is chosen to be = 125 GeV for 
all signal samples. The BDT input variables used at both centre-of-mass energies are listed 
in table 5. Most of these variables have straightforward definitions, and the more complex 
ones are defined in the following. 

• AR(ti,T 2 ): the distance A R between the two leptons, between the lepton and Ti iac j, 
or between the two ri iac j candidates, depending on the decay mode. 

• p^ otal : magnitude of the vector sum of the transverse momenta of the visible tau 
decay products, the two leading jets, and E!f lss . 

• Sum scalar sum of the pt of the visible components of the tau decay products 
and of the jets. 

• E!f lss (f> centrality: a variable that quantifies the relative angular position of the missing 
transverse momentum with respect to the visible tau decay products in the transverse 
plane. The transverse plane is transformed such that the direction of the tau decay 
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products are orthogonal, and that the smaller <j> angle between the tau decay prod¬ 
ucts defines the positive quadrant of the transformed plane. The E!f lss (/) centrality 
is defined as the sum of the x- and y-components of the E™ ms unit vector in this 
transformed plane. 


Sphericity: a variable that describes the isotropy of the energy flow in the event [101]. 
It is based on the quadratic momentum tensor 

'EiPfPi 


Gj&fi _ 


(5.1) 


EJplT 

In this equation, a and /3 are the indices of the tensor. The summation is performed 
over the momenta of the selected leptons and jets in the event. The sphericity of the 
event (S') is then defined in terms of the two smallest eigenvalues of this tensor, A 2 
and A 3 , 

S = - (A 2 + A 3 ). (5.2) 

min(A% 1 ^ 2 j e t s ): the minimum A?? between the dilepton system and either of the two 
jets. 


• Object 7 ? centrality: a variable that quantifies the rj position of an object (an isolated 
lepton, a Th ac j candidate or a jet) with respect to the two leading jets in the event. It 
is defined as 


C m , m (p) = ex P 


-4 


P 


Pi + P2 


(5.3) 


(hi -P 2) 2 

where 77 , 771 and 772 are the pseudorapidities of the object and the two leading jets 
respectively. This variable has a value of 1 when the object is halfway in 77 between 
the two jets, 1 /e when the object is aligned with one of the jets, and < 1 /e when the 
object is not between the jets in 77 . In the Ti ep ri ep channel the 77 centrality of a third 
jet in the event, , 772 (?7j 3 ), an d the product of the 77 centralities of the two leptons 
are used as BDT input variables, while in the Ti ep Th a d channel the 77 centrality of 
the lepton, C Vljri2 (p/), is used, and in the Th a dTh a d channel the 77 centrality of each r, 
C m)7l2 (p T1 ) and C m i??2 {p T2 ) , is used. Events with only two jets are assigned a dummy 
value of —0.5 for C rn , m {p TA )• 


Among these variables the most discriminating ones include , AR{t\,T 2 ) and 

At? (71 ,7/2)- Figures 2 and 3 show the distributions of selected BDT input variables. For 
the VBF category, the distributions of Ar](ji,j 2 ) and centrality are shown for all three 
channels. For the boosted category, the distributions of Ai?(Ti,r 2 ) and E™ lss cj) centrality 
are shown for the TiepT^d and ThadThad channels, and the distribution of the px °f the 
leading jet and the sphericity are shown for the ri ep ri ep channel. For all distributions, the 
data are compared to the predicted SM backgrounds at yfs = 8 TeV. The corresponding 
uncertainties are indicated by the shaded bands. All input distributions are well described, 
giving confidence that the background models (from simulation and data) describe well the 
relevant input variables of the BDT. Similarly, good agreement is found for the distributions 
at y/s — 7 TeV. 
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Variable 

VBF 

TlepBep BepThad 7 had 7 "had 

Boosted 

'TLepTlep BepThad 'Hiad^had 

W MMC 

• • • 

• • • 

A R(ti,t 2 ) 

• • • 

• • 

Ar/(ji,j 2 ) 

• • • 


m jl,j 2 

• • • 


Vh X Vj2 

• • 


Total 

P T 

• • 


Sum px 


• • 

Pt/Pt 


• • 

E™ lss rf> centrality 

• • 

• • • 

m W,h 


• 



• 

A</>(4,^) 


• 

Sphericity 


• 



• 

p’t 


• 

E^/p^ 


• 

r?iT 

• 

• 

min(A% 1 ^ ! j ets ) 

• 


i) ' C»?i, 772 (%) 

• 


C»?i,r/2 iVi) 

• 


Crn ,n 2 ( 7 7j3) 

• 


C'?l ,»?2 (Vti ) 

• 


Cm ,m (jlm ) 

• 



Table 5. Discriminating variables used in the training of the BDT for each channel and category 
at y/s = 8 TeV. The more complex variables are described in the text. The filled circles indicate 
which variables are used in each case. 
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Figure 2. Distributions of important BDT input variables for the three channels and the two 
categories (VBF, left) and (boosted, right) for data collected at = 8 TeV. The distributions 
are shown for (a) the separation in pseudorapidity of the jets, Arj(j \, j' 2 ), and (b) the transverse 
momentum of the leading jet p ^ in the Ti ep Ti ep channel, for (c) Ar;(ji,j 2 ) and (d) AR(t\,T 2 ), 
the distance A R between the lepton and r^ad) in the Ti ep Th a d channel and for (e) Arj(ji,j 2 ) and 
(f) AR(ti,t 2 ), the distance A R between the two Th a d candidates, in the Thadihad channel. The 
contributions from a Standard Model Higgs boson with mn = 125 GeV are superimposed, multiplied 
by a factor of 50. These figures use background predictions made without the global fit defined in 
section 8. The error band includes statistical and pre-fit systematic uncertainties. 
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Figure 3. Distributions of important BDT input variables for the three channels and the two 
categories (VBF, left) and (boosted, right) for data collected at V s = 8 TeV. The distributions are 
shown for (a) the product of the lepton centralities, C miJ?2 (%) • CV )I72 (% 2 )) and (b) the sphericity 
in the Ti ep Ti e p channel, for (c) the centrality of the lepton, C m ^ 2 {rie), and (d) the E^ lss <p centrality 
in the Ti ep Tj la d channel, and for (e) the centrality of the subleading tau, C rn )J)2 (r/ T2 ), and (f) the 
E^ lss <p centrality in the ihadihad channel. The contributions from a Standard Model Higgs boson 
with rriH = 125 GeV are superimposed, multiplied by a factor of 50. These figures use background 
predictions made without the global fit defined in section 8. The error band includes statistical and 
pre-fit systematic uncertainties. 
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6 Background estimation 


The different final-state topologies of the three analysis channels have different background 
compositions which necessitate different strategies for the background estimation. In gen¬ 
eral, the number of expected background events and the associated kinematic distributions 
are derived from a mixture of data-driven methods and simulation. The normalisation 
of several important background contributions is performed by comparing the simulated 
samples of individual background sources to data in regions which only have a small or 
negligible contamination from signal or other background events. The control regions used 
in the analysis are summarised in table 6. 

Common to all channels is the dominant Z —> tt background, for which the kinematic 
distributions are taken from data by employing the embedding technique, as described 
in section 3. Background contributions from jets that are misidentified as hadronically 
decaying taus (fake backgrounds) are estimated by using either a fake-factor method or 
samples of non-isolated Th a d candidates. Likewise, samples of non-isolated leptons are used 
to estimate fake-lepton contributions from jets or hadronically decaying taus and leptons 
from other sources, such as heavy-quark decays. 8 

Contributions from various other physics processes with leptons and/or Th a d candidates 
in the final state are estimated using the simulation, normalised to the theoretical cross 
sections, as given in table 1. A more detailed discussion of the estimation of the various 
background components in the different channels is given in the following. 

6.1 Background from Z —> tt production 

A reliable modelling of the irreducible Z —>• tt background is an important ingredient of 
the analysis. It has been shown in other ATLAS analyses that existing Z+jets Monte Carlo 
simulation needs to be reweighted to model data correctly [102-104]. Additionally, it is not 
possible to select a sufficiently pure and signal-free Z —>• tt control sample from data to 
model the background in the signal region. Therefore this background is estimated using 
embedded data, as described in section 3. This procedure was extensively validated using 
both data and simulation. To validate the subtraction procedure of the muon cell energies 
and tracks from data and the subsequent embedding of the corresponding information from 
simulation, the muons in Z — > pp events are replaced by simulated muons. The calorimeter 
isolation energy in a cone of A R — 0.3 around the muons from data before and after 
embedding is compared in figure 4(a). Good agreement is found, which indicates that no 
deterioration (e.g. possible energy biases) in the muon environment is introduced. Another 
important test validates the embedding of more complex Z —>• tt events, which can only be 
performed in the simulation. To achieve a meaningful validation, the same MC generator 
with identical settings was used to simulate both the Z —> pp and Z —>• tt events. The 
sample of embedded events is corrected for the bias due to the trigger, reconstruction and 
acceptance of the original muons. These corrections are determined from data as a function 
of pi p and rj(p), and allow the acceptance of the original selection to be corrected. The tau 
decay products are treated like any other objects obtained from the simulation, with one 

8 For simplicity, leptons from heavy-quark decays are considered as fake leptons in the following. 
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important difference due to the absence of trigger simulation in this sample. Trigger effects 
are parameterised from the simulation as a function of the tau decay product pt- After 
replacing the muons with simulated taus, kinematic distributions of the embedded sample 
can be directly compared to the fully simulated ones. As an example, the reconstructed 
invariant mass, is shown in figure 4(b), for the TiepThad final state. Good agreement 

is found and the observed differences are covered by the systematic uncertainties. Similarly, 
good agreement is found for other variables, such as the missing transverse momentum, the 
kinematic variables of the hadronically decaying tau lepton or of the associated jets in the 
event. A direct comparison of the Z —> tt background in data and the modelling using the 
embedding technique also shows good agreement. This can be seen in several kinematic 
quantity distributions, which are dominated by Z —> tt events, shown in figure 2. 

The normalisation of this background process is taken from the final fit described in 
section 8. The normalisation is independent for the ri ep Ti e p, Ti ep Th a d, and 7had T had analysis 
channels. 
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Figure 4. (a) The distribution of the calorimeter isolation energy I(Et, 0.3) ■ pl^ within a cone of 

radius A R = 0.3 around the muons in Z —> pp, events from data, before and after the embedding 
of simulated muons, (b) The distribution of the reconstructed invariant tt mass, m)^ MC , in the 
Tiepihad final state, for simulated Z —>■ tt events, compared to the one obtained from simulated 
Z —)• pp events after tau embedding. The ratios of the values before and after the embedding 
and between the embedded Z —> pp and Z — ► tt events are given in (a) and (b) respectively. 
The errors in (a) and (b) on the ratios (points) represent the statistical uncertainties, while the 
systematic uncertainties are indicated by the hatched bands in (b). The shaded bands represent 
the statistical uncertainties from the Z —> pp data events in (a) and from the Z — ► tt simulation 
in (b). 
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6.2 Background from misidentified leptons or hadronically decaying taus 

For the Ti ep Ti ep channel, all background sources resulting from misidentified leptons are 
treated together. In this approach, contributions from multijet and IF+jets production, 
as well as the part of the tt background resulting from decays to leptons and hadrons 
(tt —> tvb qqb) are included. A control sample is defined in data by inverting the isolation 
requirements for one of the two leptons, while applying all other signal region requirements. 
The contributions from other background channels (dileptonic tt decays (tt —> Ivb Ivb ), 
Z — > ee, Z — > /ip, and diboson production) are obtained from the simulation and are 
subtracted. From this control sample a template is created. The normalisation factor is 
obtained by fitting the px distribution of the subleading lepton at an early stage of the 
preselection. 

For the Ti ep Th a d channel, the fake-factor method is used to derive estimates for the 
multijet, IF+jets, Z+jets, and semileptonic tt background events that pass the Ti ep Th a d 
selection due to a misidentified ri iac j candidate. The fake factor is defined as the ratio of the 
number of jets identified as medium Th a d candidates to the number satisfying the loose, but 
not the medium, criteria. Since the fake factor depends on the type of parton initiating the 
jet and on the px of the jet) it is determined as a function of px separately for samples en¬ 
riched in quark- and gluon-initiated jets. In addition, the fake factor is found to be different 
for 1-track and 3-track candidates. Three different, quark-jet dominated samples are used 
separately for the IF+jets, tt and Z+jets background components. They are defined by se¬ 
lecting the high-mx region (mx > 70 GeV), by inverting the 6-jet veto and by requiring two 
leptons with an invariant mass consistent with mz (80 GeV < mu < 100 GeV) respectively. 
In addition, a multijet sample dominated by gluon-initiated jets is selected by relaxing the 
lepton identification and requiring it to satisfy the loose identification criteria. The derived 
fake factors are found to vary from 0.124 (0.082) for px = 20 GeV to 0.088 (0.038) for 
px — 150 GeV for 1-track (3-track) candidates in the VBF category. The corresponding 
values for the boosted category are 0.146 (0.084) for px — 20 GeV and 0.057 (0.033) for 
px — 150 GeV. To obtain the fake-background estimate for the VBF and boosted signal 
regions, these factors are then applied, weighted by the expected relative IF+jets, Z+jets, 
multijet, and tt fractions, to the events in regions defined by applying the selections of the 
corresponding signal region, except that the Thad candidate is required to pass the loose and 
to fail the medium Th a d identification. As an example, the good agreement between data 
and background estimates is shown in figure 5(a) for the reconstructed tt mass for events 
in the high-mx region, which is dominated by IF+jets production. 

For the Thad+iad channel, the multijet background is modelled using a template ex¬ 
tracted from data that pass the VBF or boosted category selection, where, however, the 
taus fail the isolation and opposite-sign charge requirements (the number-of-tracks require¬ 
ment is not enforced). The normalisation of the multijet background is first determined by 
performing a simultaneous fit of the multijet (modelled by the data sample just mentioned) 
and Z — y tt (modelled by embedding) templates after the preselection cuts. The fit is per¬ 
formed for the distribution of the difference in pseudorapidity between the two hadronic tau 
candidates, Ar/(Th a d, Th a d)- The signal contribution is expected to be small in this category. 
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The agreement between data and the background estimate for this distribution is shown 
in figure 5(b) for the rest category defined in section 4. The preselection normalisation is 
used as a reference point and starting value for the global fit (see below) and is used for 
validation plots. The final normalisations of the two important background components, 
from multijet and Z —> rr events, are extracted from the final global fit, as described in 
section 8 , in which the A^r^ad, 7h a d) distribution for the rest category is included. 


> 

CL) 

o 

o 

C\J 

£ 

c 

CL) 

> 

LU 


240f- 

220- X|ep ‘ had 

{s = 8 TeV, 20.3 fb 1 


, VBF 


2001 

1801 


W+jets 

Control Region 


-T | I 1 ! 

ATLAS 

• Data 

— 50 x H( 125) 

Z—> XT 

■ ff+single-top 

■ Others 

■ Fake x 
^ Uncert. 



200 300 

m T M x MC [GeV] 


(a) 



Figure 5. (a) The distribution of the reconstructed invariant rr mass, m^. MC , for events in 

the IT+jets control region, for the ri ep rh a d channel, (b) The separation in pseudorapidity of the 
fhad candidates, Ar;(rhadiihad)) f° r the 'Tiad'thad channel in the rest control region. The expected 
SM Higgs boson signal contribution is superimposed, multiplied by a factor 50. These figures use 
background predictions made without the global fit defined in section 8. The error band includes 
statistical and pre-fit systematic uncertainties. 


6.3 Z —> ee and Z —> /i/i background 

The Drell-Yan Z/ 7 * —> ee and Z/ 7 * —F /i/J background channels are important contribu¬ 
tions to the final states with two same-flavour leptons. They also contribute to the other 
channels. As described below, a simulation based on ALPGEN is used to estimate these 
background sources. Correction factors are applied to account for differences between data 
and simulation. 

In the TiepTiep channel, the ALPGEN simulation is normalised to the data in the Z -mass 
control region, 80 GeV < mn> < 100 GeV, for each category, and separately for Z —> ee and 
Z —> /i/u events. The normalisation factors are determined from the final fit described in 
section 8 . The distribution of the reconstructed rr mass for events in this control region is 
shown in figure 6 (a). 
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In the t i e p7had channel, the Z —> ee and Z —> background estimates are also based 
on simulation. The corrections applied for a Th a d candidate depend on whether it originates 
from a lepton from the Z boson decay or from a jet. In the first case, corrections from data, 
derived from dedicated tag-and-probe studies, are applied to account for the difference in 
the rate of misidentified Th ac j candidates resulting from leptons [21, 105]. This is particularly 
important for Z —> ee events with a misidentified 7h a d candidate originating from a true 
electron. In the second case, the fake-factor method described in section 6.2 is applied. 

In the t hadhad channel, the contribution of this background is very small and is taken 
from simulation. 
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Figure 6. (a) The distribution of the reconstructed invariant rr mass, ?u^. MC , for events in 

the Z —> U control region, for the Ti ep Ti ep channel, (b) The distribution of the separation in 
pseudorapidity of the two leading jets, A? 7 (ji,j 2 ), for events in the top control region, for the 
t tepThad channel. This figure uses background predictions made without the global fit defined in 
section 8. The error band includes statistical and pre-fit systematic uncertainties. 


6.4 W+jets background 

Events with W bosons and jets constitute a background to all channels since leptonic W 
decays can feed into all signatures when the true lepton is accompanied by a jet which 
is falsely identified as a Th a d or a lepton candidate. This process can also contribute via 
semileptonic heavy quark decays that provide identified leptons. 

As stated in section 6.2, for the ri cp ri ep and Ti ep Th a d channels, the TT+jets contribu¬ 
tions are determined with data-driven methods. For the Th a dTh a d channel, the W —> Th a d^ 
background is estimated from simulation. A correction is applied to account for differences 
in the Th a d misidentification probability between data and simulation. 
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6.5 Background from top-quark production 

Background contributions from tt and single top-quark production, where leptons or hadron¬ 
ic ally decaying taus appear in decays of top quarks, are estimated from simulation in the 
T lep' r lep and Ti ep Th a d channels. The normalisation is obtained from data control regions de¬ 
fined by requiring a 6-jet instead of a 6-veto. In the Ti cp Th a d channel, a large value of the 
transverse mass tot is also required, to enhance the background from top-quark production 
and to suppress the signal contribution. This background is also found to be small for the 
7had T had channel and it is estimated using simulation. The distribution of Ar/(ji, j’ 2 ) for 
events in the top control region, for the ri ep Th a d channel, is shown in figure 6 (b). 

6.6 Diboson background 

The production of pairs of vector bosons (W + W~, ZZ and W^Z), with subsequent decays 
to leptons or jets, contributes especially to the background in the Ti ep Ti ep channel. For 
all analysis channels, these contributions are estimated from simulation, normalised to the 
NLO cross sections indicated in table 1. 

6.7 Contributions from other Higgs boson decays 

In the TiepTiep channel, a non-negligible contribution from H —> WW —> Ivlv exists and 
this process is considered as background. Its contribution is estimated for run = 125 GeV 
using simulation. The corresponding signal cross section is assumed to be the SM value 
and is indicated in table 1. 

6.8 Validation of background estimates 

As described above, the normalisation for important background sources that are modelled 
with simulation are determined by fitting to data in control regions. These normalisations 
are compared in table 7 to predictions based on the theoretical cross sections for the 8 TeV 
analysis. In most cases, the values obtained are compatible with unity within the statistical 
uncertainties shown. For the top control region in the VBF category of the Ti ep Th a d channel, 
the value is also in agreement with unity if the experimental and theoretical systematic 
uncertainties are included. The control-region normalisations are used for validation plots, 
and they are used as starting values in the final global fit described in section 8. The global 
fit does not change any of these normalisations by more than 2%. 

It is important to verify that the BDT output distributions in data control regions are 
well described after the various background determinations. Figure 7 shows distributions 
from important control regions for the y/s = 8 TeV dataset, i.e. the Z-enriched control 
regions for the ri ep ri ep and Ti ep Th a d channels, and the reconstructed tt invariant mass side¬ 
band control region (defined as m^ MC < 100 GeV or to^. mc > 150 GeV) for the ThadThad 
channel. The distributions are shown for both the VBF and the boosted categories. All 
distributions are found to be well described, within the systematic uncertainties. 
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Channel 

Background 

Scale factors (CR) 

VBF Boosted 

T’lep'Hep 

Top 

Z — y ee 

Z n/i 

0.99 ± 0.07 1.01 ± 0.05 

0.91 ± 0.16 0.98 ± 0.10 

0.97 ± 0.13 0.96 ± 0.08 

T’lep’T’had 

Top 

0.84 ± 0.08 0.96 ± 0.04 


Table 7. The scale factors calculated in control regions (CR) for background normalisation. Only 
the statistical uncertainties are given. The background contributions shown in validation plots use 
the normalisation predicted from simulation multiplied by the corresponding scale factor. Final 
normalisations are taken from the global fit, described in section 8. 
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Figure 7. Distributions of the BDT output for data collected at yfs = 8 TeV, compared to the 
expected background contributions in important control regions. The distributions are shown for 
the VBF (left) and boosted (right) categories (a,b) for the Z —> rr-enriched control region in the 
TiepTiep channel, (c,d) for the Z —> rr-enriched control region in the ri ep rh a d channel and (e,f) for 
the rr invariant mass sideband control region in the ThadThad channel. The contributions from a 
Standard Model Higgs boson with run = 125 GeV are superimposed, multiplied by a factor of 50. 
These figures use background predictions made without the global fit defined in section 8. The 
error band includes statistical and pre-fit systematic uncertainties. 
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7 Systematic uncertainties 


The numbers of expected signal and background events, the input variables to the BDT, 
and thereby the BDT output and the final discrimination between signal and background 
are affected by systematic uncertainties. They are discussed below, grouped into three 
categories: experimental uncertainties, background modelling uncertainties, and theoretical 
uncertainties. For all uncertainties, the effects on both the total signal and background 
yields and on the shape of the BDT output distribution are evaluated. Table 8 gives a 
summary of the systematic uncertainties and their impact on the number of expected events 
for the signal and the total background for the analysis of the data taken at y/s — 8 TeV. 
The dominant sources that affect the shape of the BDT output distribution are marked 
in the table. All uncertainties are treated either as fully correlated or uncorrelated across 
channels. The latter are also marked in table 8. The effects of the systematic uncertainties 
at yfs — 7 TeV are found to be similar and are not discussed here. The inclusion of the 
uncertainties in the profile likelihood global fit is described in section 8 and the effect of 
the most significant systematic uncertainties is presented in table 13. 

7.1 Experimental uncertainties 

The major experimental systematic uncertainties result from uncertainties on efficiencies 
for triggering, object reconstruction and identification, as well as from uncertainties on the 
energy scale and resolution of jets, hadronically decaying taus and leptons. In general, the 
effects resulting from lepton-related uncertainties are smaller than those from jets and taus. 
They are not discussed in detail, however, their impact is included in table 8. In addition, 
uncertainties on the luminosity affect the number of signal and background events from 
simulation. 

• Luminosity: the uncertainty on the integrated luminosity is ±2.8% for the 8 TeV 
dataset and ±1.8% for the 7 TeV dataset. It is determined from a calibration of 
the luminosity scale derived from beam-separation scans performed in 2011 and 2012 
using the method described in ref. [106]. 

• Efficiencies: the efficiencies for triggering, reconstructing and identifying electrons, 
muons, and Thad candidates are measured in data using tag-and-probe techniques. 
The uncertainties on the Thad identification efficiency are ±(2-3)% for 1-prong and 
±(3-5)% for 3-prong tau decays [39]. The 6-jet tagging efficiency has been measured 
from data using tt events, where both top quarks decay to leptons, with a total 
uncertainty of about ±2% for jets with transverse momenta up to 100 GeV [38, 107]. 
The MC samples used are corrected for differences in these efficiencies between data 
and simulation and the associated uncertainties are propagated through the analysis. 

• Energy scales: the uncertainties on the jet energy scale (JES) arise from several 
sources. These include, among others, varied response due to the jet flavour com¬ 
position (quark- versus gluon-initiated jets), pile-up, r/ intercalibration, and detector 
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response and modelling of in situ jet calibration [35, 36]. The impact of the JES un¬ 
certainty in this analysis is reduced because many of the background components are 
estimated using data. The tau energy scale is obtained by fitting the reconstructed 
visible mass for Z —> rr events in data, which can be selected with a satisfactory 
purity. It is measured with a precision of ±(2-4)% [108]. Since systematic uncer¬ 
tainties on the energy scales of all objects affect the reconstructed missing transverse 
momentum, it is recalculated after each variation is applied. The scale uncertainty 
on E™ 1SS due to the energy in calorimeter cells not associated with physics objects is 
also taken into account. 

• Energy resolutions: systematic uncertainties on the energy resolution of taus, elec¬ 
trons, muons, jets, and f?™ lss affect the final discriminant. The effects resulting from 
uncertainties on the tau energy resolution are small. The impact of changes in the 
amount of material (inactive material in the detector, e.g. support structures), in the 
hadronic shower model and in the underlying-event tune were studied in the simula¬ 
tion. They result in systematic uncertainties below 1% on the tau energy resolution. 
The jet energy resolution is determined by in situ measurements, as described in 
ref. [109], and affects signal modelling and background components modelled by the 
simulation. The uncertainty of the resolution on E!f lss is estimated by evaluating the 
energy resolution of each of the E!f lss terms. The largest impact results from the soft 
term (see section 2), arising both from the MC modelling and the effects of pile-up. 
It is evaluated using simulated Z(—> /r/i)±jets events. 

7.2 Background modelling uncertainties 

The most significant systematic uncertainties on the background estimation techniques, as 
described in section 6, are detailed in the following for the three decay modes considered. 

In the TiepTjep channel, systematic uncertainties on the shape and normalisation of fake- 
lepton background sources are estimated by comparing samples of same-sign lepton events 
that pass and fail the lepton isolation criteria. These uncertainties amount to ±33% (±20%) 
at 8 TeV and ±10.5% (±13%) at 7 TeV for the boosted (VBF) category. The extrapolation 
uncertainty for the Z —> it background is obtained by varying the mu window that defines 
the control region for this background, and amounts to about ±6%. The corresponding 
extrapolation uncertainty for top-quark background sources is ±(3-6)%, obtained from 
the differences in event yields in the top-quark control regions when using different MC 
generators. Neither of these extrapolation uncertainties is significant for the final result. 
The dominant uncertainties on the normalisation of the tt background, obtained from the 
global fit, are the systematic uncertainties on the 6-jet tagging efficiency and the jet energy 
scale. 

In the TjepThad channel, an important systematic uncertainty on the background deter¬ 
mination comes from the estimated fake background, for which several sources of systematic 
uncertainty are considered. The statistical uncertainty on the effective fake factor is ±4.3% 
(±2.3%) in the 8 TeV VBF (boosted) category, and about ±22% (±11%) in the 7 TeV VBF 
(boosted) category. The dominant systematic uncertainty on the methodology itself arises 
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from the composition of the combined fake background (W±jets, Z+jets, multijet, and 
tt fractions), which is largely estimated based on simulated event samples as explained in 
section 6.2. The uncertainty is estimated by varying each fractional contribution by ±50%, 
which affects the effective fake factor by ±3% (±6%) and by ±10% (±15%) in the 8 TeV 
and 7 TeV boosted (VBF) categories respectively. As a closure test, the method was also 
applied in a region of data where the lepton and Th ac j candidate have the same charge, rich 
in fake Th a( j candidates. Very good agreement was observed between data and the method’s 
prediction, so that no additional in situ uncertainty was deemed necessary. In addition, the 
uncertainties on the normalisation of the tt background are important. As in the case of 
the t iepT| ep channel, the dominant contribution obtained from the global fit originates from 
systematic uncertainties on the 6-jet tagging efficiency and the jet energy scale, along with 
statistical uncertainties on the observed data in the respective control regions. 

In the t h a d7"had channel, the major background from multijet production is determined 
using a data-driven template method. The default multijet template, derived from a sample 
in data where the Thad candidates fail the isolation and opposite-sign charge requirements, is 
compared with an alternative template derived from a sample where the Th a d candidates fail 
just the opposite-sign charge requirement. The normalisation of the alternative template 
is fixed to that of the default template at preselection; the alternative multijet template is 
propagated into the various categories and gives a different set of yields from the default 
template. This difference, along with the difference in shape between the two templates, 
constitutes the systematic uncertainty on the background estimate. This leads to an overall 
multijet yield variation of 10% (3%) in the VBF (boosted) category at y^^STeV and of 
10% (30%) in the VBF (boosted) category y / s = 7TeV. However, there is a very strong 
shape dependence, such that the uncertainties on the BDT output are much larger at 
higher output values. 

For the embedding method used in all channels, the major systematic uncertainties are 
related to the selection of Z —> /x/x events in data and to the subtraction of the muon energy 
deposits in the calorimeters. The selection uncertainties are estimated by varying the muon 
isolation criteria in the selection from the nominal value of < 0.2 (see section 4) 

to tighter (J(pt> 0.4) < 0.06 and I(Et, 0.2) < 0.04) and looser (no isolation requirements) 
values. The muon-related cell energies to be subtracted are varied within ±20% (±30%) for 
the 8 TeV (7 TeV) data. In addition, systematic uncertainties on the corrections for trigger 
and reconstruction efficiencies are taken into account. Due to the combination of single¬ 
lepton and dilepton triggers used, the uncertainties are largest for the Ti ep Ti ep channel. All 
experimental systematic uncertainties relating to the embedded r decay products (such as 
tau energy scale or identification uncertainties) are applied normally. The combined effect 
of all uncertainties on the signal and background yields is included in table 8. Because the 
Z —y tt normalisation is determined in the final fit, the impact on the final result is much 
smaller. 

7.3 Theoretical uncertainties 

Theoretical uncertainties are estimated for the signal and for all background contributions 
modelled with the simulation. Since the major background contributions, from Z —>• tt 
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and misidentification of hadronically decaying r leptons, are estimated using data-driven 
methods, they are not affected by these uncertainties. Uncertainties on the signal cross 
sections are assigned from missing higher-order corrections, from uncertainties in the PDFs, 
and from uncertainties in the modelling of the underlying event. 

For VBF and VH Higgs boson production cross sections, the uncertainties due to miss¬ 
ing higher-order QCD corrections are estimated by varying the factorisation and renormal¬ 
isation scales by factors of two around the nominal scale mjy, as prescribed by the LHC 
Higgs Cross Section Working Group [110]. The resulting uncertainties range from ±2% to 
±4%, depending on the process and the category-specific selection considered. In addition, 
a 2% uncertainty related to the inclusion of the NLO EWK corrections (see section 3) is 
assigned. 

For Higgs boson production via ggF, the uncertainties on the cross sections due to 
missing higher order QCD corrections are estimated by varying the renormalisation and 



factorisation scales around the central values 


cross 


section calculations of H + 1-jet and H + 2-jet production. In the calculation of the un¬ 
certainties, appropriate cuts on the Higgs px (j?x > 100 GeV ) and on the jet kinematics 
(Ap,px) are applied at parton level for the boosted and VBF categories respectively. The 
resulting uncertainties on the ggF contributions are found to be about ±24% in the boosted 
category and ±23% in the VBF category. The ggF contribution is dominant in the boosted 
category, whereas it is only about 20% of the signal in the VBF category. Since the two cat¬ 
egories are exclusive, their anti-correlation is taken into account following the prescription 
in ref. [111]. 

In the present analysis, no explicit veto on jets is applied in the VBF selection, but 
enough kinematical information is provided as input to the BDT so that the high BDT- 
output region corresponds to a more exclusive region, where the probability of finding 
a third jet is reduced. Since the cross section for gluon-fusion events produced with a 
third jet is only known at LO, this could introduce a large uncertainty on the gluon-fusion 
contamination in the highest (and most sensitive) BDT-output bins. The uncertainty on the 
BDT shape of the ggF contribution is evaluated using the Mcfm Monte Carlo program [98], 
which calculates H + 3 jets at LO. Scale variations induce changes of the ggF contribution 
in the highest BDT bin of about ±30%. They are taken into account in the final fit. 

Uncertainties related to the simulation of the underlying event and parton shower are 
estimated by comparing the acceptance from Powheg±Pythia to Powheg±Herwig for 
both VBF and ggF Higgs boson production modes. Differences in the signal yields range 
from ±1% to ±8% for the VBF and from ±1% to ±9% for ggF production, depending 
on the channel and category. The BDT-score distribution of the PoWHEG±PYTHIA and 
Powheg±Herwig samples are compatible with each other within statistical uncertainties. 

The PDF uncertainties are estimated by studying the change in the acceptance when 
using different PDF sets or varying the CT10 PDF set within its uncertainties. The stan¬ 
dard VBF POWHEG sample and a MC@NLO [112] ggF sample, both generated with the 
CT10 PDFs, are reweighted to the MSTW2008NLO [113], NNPDF [114] and the CT10 
eigen-tunes parameterisation. The largest variation in acceptance for each category is used 
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as a constant PDF uncertainty; it varies between approximately ±4.5% and ±6% for ggF 
production and between about ±0.8% and ±1.0% for VBF production. A shape uncertainty 
is also included to cover any difference between the BDT score in the default sample and 
the reweighted ones. The uncertainty on the total cross section for the VBF, VH and ggF 
production modes due to the PDFs is also considered. 

Variations in the acceptance for different Monte Carlo generators are also included, 
comparing Powheg±F1erwig samples to MC@NLO±Herwig samples for ggF, and to 
aMC@NLO±Her,WIG [115] samples for VBF. The generator modelling uncertainty is 
around ±2% for ggF and ±4% for VBF productions modes. 

Finally, an uncertainty on the decay branching ratio, BR (H —> rr), of ±5.7% [70] 
affects the signal rates. 

The theoretical systematic uncertainties on the background predictions taken from 
the simulation are evaluated by applying the same procedures as used for the signal sam¬ 
ples. Uncertainties resulting from the choice of QCD scales, PDF parameterisation and 
underlying-event model are estimated. The results are reported in table 8. 
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Table 8. Impact of systematic uncertainties on the total signal, S, (sum of all production modes) and on the sum of all background estimates, 
B, for each of the three channels and the two signal categories for the analysis of the data taken at yfs = 8 TeV. Each systematic uncertainty 
is assumed to be correlated across the analysis channels, except those marked with a *. Uncertainties that affect the shape of the BDT-output 
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8 Signal extraction procedure 


The BDT output in the six analysis categories provides the final discrimination between 
signal and background for both the 7 and 8 TeV datasets. A maximum-likelihood fit is 
performed on all categories simultaneously to extract the signal strength, /x, defined as 
the ratio of the measured signal yield to the Standard Model expectation. The value 
/x = 0 (/x = 1) corresponds to the absence (presence) of a Higgs boson signal with the SM 
production cross section. The statistical analysis of the data employs a binned likelihood 
function £(//, 9), constructed as the product of Poisson probability terms, to estimate ^x. 

The impact of systematic uncertainties on the signal and background expectations is 
described by nuisance parameters, 8, which are each parameterised by a Gaussian or log¬ 
normal constraint. The expected numbers of signal and background events in each bin are 
functions of 6. The test statistic q^ is then constructed according to the profile likelihood 

ratio: q M = — 21n[£(/x, 9)/C(fi, 0)], where fi and 9 are the parameters that maximise the 

likelihood, and 9 are the nuisance parameter values that maximise the likelihood for a given 
/x. This test statistic is used to measure the compatibility of the background-only hypothesis 
with the observed data. 

The likelihood is maximised on the BDT distributions in the signal regions, with in¬ 
formation from control regions included to constrain background normalisations. The fit 
includes the event yields from the Z —> ££ and top control regions in the Ti e pTi e p channel, 
and from the top control region of the Ti e pTh a d channel; furthermore the Ax^Thad; "Hiad) 
distribution in the rest control region of the Thad'Thad channel is also included. 

The Z —> rr background is constrained primarily in the signal regions, due to the dif¬ 
ference between the BDT distributions for Z —> rr events and the signal. For the Thad^had 
channel, the Z —> rr and multijet background rates are constrained by the simultaneous fit 
of the two signal regions and the A^(rh a d' r had) distribution in the rest category control re¬ 
gion. The top and Z —> ll background components for the ri ep ri e p and Ti ep Th a d channels are 
also allowed to float freely, but are primarily constrained by the inclusion of the respective 
control regions. 

As described in section 7, a large number of systematic uncertainties, taken into ac¬ 
count via nuisance parameters, affect the final results. It is important to investigate the 
behaviour of the global fit and in particular to investigate how far the nuisance parameters 
are pulled away from their nominal values and how well their uncertainties are constrained. 
Furthermore, it is important to understand which systematic uncertainties have the most 
impact on the final result. For this purpose a ranking of nuisance parameters is introduced. 
For each parameter, the fit is performed again with the parameter fixed to its fitted value 
shifted up or down by its fitted uncertainty, with all the other parameters allowed to vary. 
The ranking obtained for those nuisance parameters contributing most to the uncertainty 
on the signal strength is shown in figure 8 for the combined fit of the three channels at the 
two centre-of-mass energies. The parameters contributing most are those related to the jet 
energy scale, the normalisation uncertainties for Z —> rr and top-quark events, and the 
tau energy scale. The uncertainties on the jet energy scale are decomposed into several 
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uncorrelated components (among others: r] intercalibration of different calorimeter regions, 
jet energy response, and response to jets of different flavour). In addition, theoretical un¬ 
certainties on the branching ratio BR (H —> rr) are found to have a significant impact. In 
general, good agreement is found between the pre-fit and post-fit values for these nuisance 
parameters and neither large pulls nor large constraints are observed. 

The distributions of the BDT discriminants for all channels and categories for the data 
at 8 TeV are shown in figure 9, with background normalisations, signal normalisation, and 
nuisance parameters adjusted by the profile likelihood global fit. 

The results for the numbers of fitted signal and background events, split into the various 
contributions, are summarised in tables 9, 10 and 11 for the three channels separately, for 
the dataset collected at 8 TeV centre-of-mass energy. In addition to the total number of 
events, the expected number of events in each of the two highest BDT output bins is given. 
The number of events observed in the data is also included. Within the uncertainties, 
good agreement is observed between the data and the model predictions for the sum of 
background components and a Standard Model Higgs boson with mu — 125 GeV. 


- 35 - 


Impact = A|ii/a tot 


-1 -0.5 0 0.5 1 



Pull = (0 - e o )/A0 


Figure 8. Impact of systematic uncertainties on the fitted signal-strength parameter fi for the 
combined fit for all channels and both centre-of-mass energies. The systematic uncertainties are 
listed in decreasing order of their impact on /t on the y- axis. The hatched blue and red boxes show 
the variations of jl with respect to the total error on /i, ert 0 t> referring to the top x-axis, when fixing 
the corresponding individual nuisance parameter 9 to its post-fit value 9 modified upwards or down¬ 
wards by its post-fit uncertainty, and repeating the fit. The filled circles, referring to the bottom 
a;-axis, show the pulls of the fitted nuisance parameters, i.e. the deviations of the fitted parameters 
9 from their nominal values 9q, normalised to their nominal uncertainties A 9. The black lines 
show the post-fit uncertainties of the nuisance parameters, relative to their nominal uncertainties, 
which are indicated by the yellow band. The jet energy scale uncertainties are decomposed into 
uncorrelated components. 
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Figure 9. Distributions of the BDT discriminants for the data taken at \fs = 8 TeV in the signal 
regions of the VBF (left) and boosted (right) categories for the Ti e pT lep (top), ri e pT had (middle), 
and ThadThad (bottom) channels. The Higgs boson signal (w# = 125 GeV) is shown stacked with 
a signal strength of fi = 1 (dashed line) and /i = 1.4 (solid line). The background predictions 
are determined in the global fit (that gives /i = 1.4). The size of the statistical and systematic 
normalisation uncertainties is indicated by the hashed band. The ratios of the data to the model 
(background plus Higgs boson contributions with fi = 1.4) are shown in the lower panels. The 
dashed red and the solid black lines represent the changes in the model when /r = 1.0or/r = 0 are 
assumed respectively. 
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Table 9. The predicted post-fit event yields in the Ti ep Tiep channel for ma = 125 GeV for the total number of events and for the two highest bins 
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9 Results 


As explained in the previous section, the observed signal strength is determined from a 
global maximum likelihood fit to the BDT output distributions in data, with nuisance 
parameters that are either free or constrained. The results are extracted for each channel 
and for each category individually as well as for combinations of categories and for the 
overall combination. 

At the value of the Higgs boson mass obtained from the combination of the ATLAS 
H H► 77 and H —> ZZ* measurements [116], mu — 125.36 GeV, the signal strength obtained 
from the combined H — > rr analysis is: 

/x = 1.43 ±°-£(stat.) lo^syst.) ± 0.09(theory syst.). 

The systematic uncertainties are split into two groups: systematic uncertainties (syst.) 
including all experimental effects as well as theoretical uncertainties on the signal region 
acceptance, such as those due to the QCD scales, the PDF choice, and the underlying 
event and parton shower; and, separately, theoretical uncertainties on the inclusive Higgs 
boson production cross section and H —> rr branching ratio (theory syst.). The results for 
each individual channel and for each category as well as for their combination are shown in 
figure 10. They are based on the full dataset, however, separate combined results are given 
for the two centre-of-mass energies. 

The probability po of obtaining a result at least as signal-like as observed in the data 

if no signal were present is calculated using the test statistic q /l= o = — 2 ln(£( 0 , 6)/C(jl, 6)) 
in the asymptotic approximation [117]. For mu = 125.36 GeV, the observed po value is 
2.7 x 10~ 6 , which corresponds to a deviation from the background-only hypothesis of 4.5a. 
This can be compared to an expected significance of 3.4<r. This provides evidence at the 
level of 4.5<r for the decay of the Higgs boson into tau leptons. Table 12 shows the expected 
and observed significances for the signal strength measured in each channel separately. 

Figure 11 shows the expected and observed number of events, in bins of log 10 (S/B), 
for all signal region bins. Here, S/B is the signal-to-background ratio calculated assuming 
/x = 1.4 for each BDT bin in the signal regions. The expected signal yield for both p = 1 
and the best-fit value p = 1.4 for mu = 125 GeV is shown on top of the background 
prediction from the best-fit values. The background expectation where the signal-strength 
parameter is fixed to /x = 0 is also shown for comparison. 

To visualise the compatibility of this excess of events above background predictions 
with the SM Higgs boson at mu = 125 GeV, a weighted distribution of events as a function 
of m^. MC is shown in figure 12. The events are weighted by a factor of ln(l + S/B), 
which enhances the events compatible with the signal hypothesis. The excess of events in 
this mass distribution is consistent with the expectation for a Standard Model Higgs boson 
with mu = 125 GeV. The distributions for the predicted excess in data over the background 
are also shown for alternative SM Higgs boson mass hypotheses of mu = 110 GeV and 
m.u = 150 GeV. The data favour a Higgs boson mass of mu = 125 GeV and are less 
consistent with the other masses considered. 
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Channel and Category 

Expected Significance (a) 

Observed Significance (a) 

Rep Rep VBF 

1.15 

1.88 

TiepTlep Boosted 

0.57 

1.72 

^lep^lep Total 

1.25 

2.40 

TlepThad VBF 

2.11 

2.23 

TicpThad Boosted 

1.11 

1.01 

AepRiad Total 

2.33 

2.33 

ThadThad VBF 

1.70 

2.23 

T’had'Hiad Boosted 

0.82 

2.56 

AiadAiad Total 

1.99 

3.25 

Combined 

3.43 

4.54 


Table 12. The expected and observed significances of the signal in each channel and category for 
the combined 7 and 8 TeV datasets. 


Source of Uncertainty 

Uncertainty on /j 

Signal region statistics (data) 

+0.27 

—0.26 

Jet energy scale 

± 0.13 

Tau energy scale 

± 0.07 

Tau identification 

± 0.06 

Background normalisation 

± 0.12 

Background estimate stat. 

± 0.10 

BR (H -+ tt) 

± 0.08 

Parton shower/Underlying event 

± 0.04 

PDF 

± 0.03 

Total sys. 

+0.33 

-0.26 

Total 

+0.43 

-0.37 


Table 13. Important sources of uncertainty on the measured signal-strength parameter /u. The 
contributions are given as absolute uncertainties on the best-fit value of [i = 1.43. Various sub¬ 
components are combined assuming no correlations. 


As discussed in section 8 , the dominant uncertainties on the measurement of the signal- 
strength parameters include statistical uncertainties on the data from the signal regions, 
uncertainties on the jet and tau energy scales, uncertainties on the normalisation of the 
Z —> tt and tt background components as well as theoretical uncertainties. The contribu¬ 
tions of each of these significant sources to the uncertainty of the measured signal strength 
are summarised in table 13. 

The normalisation uncertainties on the Z -+ tt embedded sample are correlated across 
the categories in each respective channel. The global fit also constrains the normalisation 
for Z —y tt more strongly than for the Z —> It and top-quark background components, as 
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Figure 10. The best-fit value for the signal strength /j, in the individual channels and their 
combination for the full ATLAS datasets at y/s = 7 TeV and yfs = 8 TeV. The total ±lcr uncer¬ 
tainty is indicated by the shaded green band, with the individual contributions from the statistical 
uncertainty (top, black), the experimental systematic uncertainty (middle, blue), and the theory 
uncertainty (bottom, red) on the signal cross section (from QCD scale, PDF, and branching ratios) 
shown by the error bars and printed in the central column. 


the low BDT-score region is dominated by Z —> rr events. 

The measurement of the overall signal strength discussed above does not give direct 
information on the relative contributions of the different production mechanisms. Therefore, 
the signal strengths of different production processes contributing to the H —> rr decay 
mode are determined, exploiting the sensitivity offered by the use of the event categories 
in the analyses of the three channels. The data are fitted separating the vector-boson- 
mediated VBF and VH processes from gluon-mediated ggF processes. Two signal strength 
parameters, and /^vbf+vh) which scale the SM-predicted rates to those observed, are 
introduced. The two-dimensional 68 % and 95% confidence level (CL) contours in the plane 
of 1-t^gF and /ivB F+ra [84] are shown in figure 13 for mu = 125.36 GeV. The best-fit values 
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Figure 11. Event yields as a function of log 10 (5'/13), where S (signal yield) and B (background 
yield) are taken from the BDT output bin of each event, assuming a signal strength /i = 1.4. 
Events in all categories are included. The predicted background is obtained from the global fit 
(with /x = 1.4), and signal yields are shown for m# = 125 GeV at /x = 1 and /x = 1.4 (the best-fit 
value). The background-only distribution (dashed line) is obtained from the global fit, with /x fixed 
at zero. 

are 

^IgF = 2-0 ± 0.8(stat.) g(syst.) ± 0.3(theory syst.) 

and 

^VBF +VH = !-24 la 45 (stat.) l^(syst.) ± 0.08(theory syst.), 

in agreement with the predictions from the Standard Model. The two results are strongly 
anti-correlated (correlation coefficient of —48%). The observed (expected) significances of 
the tfjgF and r BF+VH signal strengths are 1.74<r (0.95a) and 2.25a (1.72a) respectively. 

A total cross section times branching ratio for H —>• rr with run — 125 GeV can 
also be measured. The central value is obtained from the product of the measured n 
and the predicted cross section used to define it. The uncertainties are similarly obtained 
by scaling the uncertainties on fj, by the predicted cross section, noting that theoretical 
uncertainties on the inclusive cross section cancel between /x and the predicted cross section 
and thus are not included for the production processes under consideration. These include 
the uncertainties on the inclusive cross section due to the QCD scale and the PDF choice as 
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Figure 12. Distributions of the reconstructed invariant tt mass, where events are 

weighted by ln(l + S/B) for all channels. These weights are determined by the signal ( S ) and 
background ( B ) predictions for each BDT bin. The bottom panel in each plot shows the difference 
between weighted data events and weighted background events (black points), compared to the 
weighted signal yields. The background predictions are obtained from the global fit with the 
run = 125 GeV signal hypothesis (signal strength = 1.4). The run = 125 GeV signal is plotted 
with a solid red line, and, for comparison, signals for mu = HO GeV (blue) and run = 150 GeV 
(green) are also shown. The signal normalisations are taken from fits to data with the corresponding 
signal mass hypotheses, and the fitted /i values are given in the figure. The signal strengths are 
shown for the Standard Model expectations (/i = 1) in (a), while in (b) the best-fit values are used. 

well as the uncertainty on the branching ratio H — » tt; however, theoretical uncertainties 
on the acceptance of the signal regions from the QCD scale and PDF choice are retained, 
along with the uncertainties due to underlying event and parton shower, and the electroweak 
correction on VBF production. Table 14 gives the measured values for the total cross section 
at 7 and at 8 TeV, as well as the measured values at 8 TeV for gluon fusion production and 
for VBF and VH production separately. 
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Figure 13. Likelihood contours for the combination of all channels in the {l^ggp > Mvbf +vh) plane. 
The signal strength y is the ratio of the measured signal yield to the Standard Model expectation, 
for each production mode. The 68% and 95% CL contours are shown as dashed and solid lines 
respectively, for mp = 125.36 GeV. The SM expectation is shown by a filled plus symbol, and the 
best fit to the data is shown as a star. 



Measured axBR [pb] 

Predicted ctxBR [pb] 

7 TeV 

1.0 ±gJ(stat.)lo;g(syst.) 

1.09 ± 0.11 

8 TeV 

2.1 ± 0.4(stat.)^Q4(syst.) 

1.39 ± 0.14 

Gluon fusion, 8 TeV 

1.7 ± l.l(stat.)^}'4(syst.) 

1.22 ± 0.14 

VBF +VH, 8 TeV 

0.26 ± 0.09(stat.)+o;o5(syst.) 

0.17 ± 0.01 


Table 14. Measured and predicted total cross section times branching ratio for H — ► rr with 
run = 125 GeV, at 7 TeV and at 8 TeV for all production modes, as well as for gluon fusion and 
for VBF and VH production separately, at 8 TeV only. The theoretical predictions are obtained as 
described in section 3. 
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10 Cross-check with cut-based analysis 


The search for the SM Higgs boson presented above is cross-checked for the dataset collected 
at y/s = 8 TeV in an analysis where cuts on kinematic variables are applied. This search 
uses improved definitions of event categories and an improved fit model with respect to 
results previously published for the y/s — 7 TeV dataset [21]. To allow a straightforward 
comparison of results, the multivariate and cut-based analyses have common components. 
The two analyses are performed for the same three channels, Ti ep Ti e p, TiepThad and Thad r had) 
they use the same preselection and share the same strategy for the estimation of background 
contributions and systematic uncertainties. As in the multivariate analysis, the irreducible 
Z —> tt background is estimated using the embedding procedure and the reducible ones are 
estimated using similar data-driven methods, as described in section 6. Finally, the same 
statistical methods are used to extract the results, although these are applied to different 
discriminating variables. While the multivariate analysis performs a fit to the BDT output 
distribution, the cut-based analysis relies on a fit to the tt invariant mass distribution. The 
tt invariant mass is calculated using the missing mass calculator, as described in section 4.3. 
The analysis is not designed to be sensitive to a specific value of the Higgs boson mass rrifj . 
The use of the mass as the discriminating variable is motivated not only by its power to 
separate the irreducible Z tt background from signal, but also by its sensitivity to the 
mass of the signal itself. 

In the cut-based analysis, a categorisation is performed similar to that in the multivari¬ 
ate analysis, i.e. VBF and boosted categories are defined. To increase the separation power, 
subcategories are introduced for the Ti ep Th a d and Th a dTh a d channels. These subcategories 
target events produced via the same production mode, but select different phase-space 
regions with different signal-to-background ratios. With this strategy the most sensitive 
subcategories have a small number of events, but a high signal-to-background ratio. Al¬ 
though the combined sensitivity is dominated by the few highly sensitive subcategories, the 
others are important not just to increase the sensitivity but also to constrain the various 
background components. 

An overview of the defined categories in the three channels is given in table 15. In 
all channels, the event categorisation is designed by splitting events first according to the 
production mode, either VBF-like or boosted ggF-like, and second, for the Ti ep Th a d and 
t h a d7"had channels, by signal-to-background ratio. The events accepted in the VBF categories 
pass a common selection that requires the presence of the two forward jets distinctive of 
VBF production. In the Ti ep Th a d channel, tight and loose VBF subcategories are defined, 
via cuts on the mass of the dijet system, rrijj, and p^, the transverse momentum of the Higgs 
boson candidate. In the Thadrhad channel, the variables used to select the most sensitive 
categories for both production modes are pt£ and the separation AR(ti,to) between the 
two Th a d candidates. In the VBF-like events, correlations between the invariant mass of 
the selected jets rrijj and A rjjj of the jets characteristic of VBF production are also used. 
The subcategory with the highest purity is the VBF high-pi P subcategory, where tight cuts 
on p and AR^ti^t/) reject almost all non-resonant background sources. The other two 
VBF-like subcategories are distinguished by a different signal-to-background ratio due to a 
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tighter selection applied to the forward jets. For the Thad T had channel, boosted subcategories 
are also defined. The division is based on the same cuts on p^ and , 72 ) as used 

in the VBF high-p^ category. Events with low transverse momentum are not used in 
any category because in such events the signal cannot be effectively distinguished from 
background channels. The proportion of the signal yield produced via VBF in the VBF- 
like subcategories is found to be 80% in the Ti ep Ti ep channel, between 67% and 85% in the 
TiepThad channel and between 58% and 78% in the ThadThad channel. 

The final results are derived from the combined fit of the m TT distributions observed 
in the various subcategories. The number of fitted signal and background events in each 
channel and category is given in table 16. The combined mass distribution for the three 
channels is shown in figure 14, where events are weighted by ln(l + S/B ), based on the 
signal and background content of their channel and category. An excess of events above 
the expected SM background is visible in the mass region around 125 GeV. 




Figure 14. Distribution of the reconstructed invariant rr mass, m^. MC , for the sum of all channels 
in the cut-based analysis for the data taken at yTs = 8 TeV. The events are weighted by a factor 
of ln(l + S/B) based on the signal ( S ) and background ( B ) yields in each category. The bottom 
panel shows the difference between weighted data events and weighted background events (black 
points), compared to the weighted signal yields. The background predictions are obtained from the 
global fit with the to# = 125 GeV signal hypothesis (signal strength p = 1.4). The to# = 125 GeV 
signal is plotted as a solid red line, and, for comparison, signals for to# = 110 GeV (blue) and 
to# = 150 GeV (green) are also shown. The signal normalisations are taken from fits to data with 
the corresponding signal mass hypotheses and the fitted p values are given in the figure. 

The signal strengths extracted in the three analysis channels and their combination are 
given in table 17. This cut-based analysis also provides evidence for H — > tt decays, giving 
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a combined signal strength of 


/^Cut—based = 1-43 (stat.) l^^syst.) ± 0.10(theory syst.) 

for run — 125.36 GeV. For comparison, the results obtained in the multivariate analysis for 
the dataset at y/s = 8 TeV are also included in table 17. Good agreement between the results 
of the two analyses is found for the individual channels as well as for their combination. 
To further quantify the level of agreement, the correlation p and the uncertainties on the 
differences between the p values obtained, i.e. A/i ± 5(Ap), were evaluated using the so- 
called jackknife technique [118, 119]. Using this method, the correlation between the p 
values obtained in the two analyses is found to be between 0.55 and 0.75 for each of the 
three analysis channels. The results of the analyses are found to be fully compatible, with 
deviations Ap/5(Ap) below 1 for all analysis channels as well as for the combined result. 

The probability po of obtaining a result at least as signal-like as observed if no signal 
were present is shown as a function of the mass in figure 15 for the cut-based analysis for 
the combined dataset at y/s — 8 TeV. The observed po values show a shallow minimum 
around 125 GeV, corresponding to a significance of 3.2<r. The expected significance for 
the cut-based analysis is superimposed on the figure and reaches a significance of 2.5<r at 
mg = 125.36 GeV. The corresponding significance values for the multivariate analysis for 
the dataset at y/s — 8 TeV are found to be 4.5(7 (observed) and 3.3<r (expected). They are 
also indicated in the figure. 

Given the mass sensitivity of the cut-based analysis, a two-dimensional likelihood fit 
for the signal strength p and the mass run is performed. The mass points are tested in 
steps of 5 GeV in the range between 100 GeV and 150 GeV. The best fit value is found 
at p = 1.4 and mu = 125 GeV. The result is shown in the (m#, p) plane in figure 16 
together with the 68% and 95% CL contours. This result indicates that the observation is 
compatible with the decay of a Standard Model Higgs boson with a mass of 125 GeV. 
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m H [GeV] 


Figure 15. Observed (solid red) and expected (dashed red) po values as a function of mn for the 
combination of all channels in the cut-based analysis (CBA) for the data taken at yfs = 8 TeV. The 
expected p 0 values are given for the background-only hypothesis. The corresponding observed and 
expected po values for the multivariate analysis (MVA) are indicated for run = 125 GeV by a full 
and open star respectively. The axis labels on the right hand side and the dotted lines display the 
significance in units of Gaussian standard deviations. 
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Channel 

VBF category selection criteria 

Tlep'Hep 

At least two jets with p^ > 40 GeV and p^? > 30 GeV 
> 3 -° 

TOj, j 2 > 400 GeV 

6-jet veto for jets with p T > 25 GeV 

Jet veto: no additional jet with px > 25 GeV within |;y| < 2.4 

^lep^had 

At least two jets with pi^ > 40 GeV and p^ > 30 GeV 

E^ iss > 20 GeV 

\Ar]jij 2 \ > 3.0 and 77 ( 71 ) • 77 (j 2 ) < 0, m 3 ij 2 > 300 GeV 

pTotal = |p€, +p^ad + 0h + pjjmiss | < 39 Ge y 

6-jet veto for jets with px > 30 GeV 
min (%'i)>%2)) < »?(/)> »7 (tw) < max(r7 (jl) ,77 (j2 )) 

VBF tight 

m juh > 500 GeV 
p% > 100 GeV 

PT had > 30 GeV 
m v j s > 40 GeV 

VBF loose 

Non tight VBF 


^had^had 

At least two jets with p ^ > 50 GeV and p > 30 GeV 
|Ary(ri,T 2 )| < 1.5 

| Ar/ 3l 72 1 > 2.6 and rap j 2 > 250 GeV 
min ( ? ?(ii)>% 2 )) < V( Ti ),V(t 2 ) < ma AV(h),V(j 2 )) 

VBF high pH 

A1?(ti,t 2 ) <1.5 and 
p% > 140 GeV 

VBF low pH , tight 

Ai?(n, r 2 ) > 1.5 or 
p^ < 140 GeV 
rrij i 2 [GeV] > (-250 • 

Apj 1 ,j 2 + 1550) 

VBF low pH , loose 

Ai?(n, r 2 ) > 1.5 or 
pf < 140 GeV 
m h ia [GeV] < (-250 • 

|A/ 7jlj2 | + 1550) 

Channel 

Boosted category selection criteria 

Tlep'Hep 

Exclude events passing the VBF selection 
p% > 100 GeV 

6-jet veto for jets with px > 25 GeV 

^lepThad 

Exclude events passing the VBF selection 
£“ iss > 20 GeV 
p% > 100 GeV 
pr(Thad) > 30 GeV 

6-jet veto for jets with px > 30 GeV 

^had^had 

Exclude events passing the VBF selection 

Ap(ri ,r 2 ) < 1.5 
p^ > 100 GeV 

Boosted high pH 

AJ?(ti,t 2 ) <1.5 and 
p% > 140 GeV 

Boosted low pH 

AR(ti,t 2 ) > 1.5 or 
pf < 140 GeV 


Table 15. Summary of the selection criteria used to define the VBF and boosted subcategories in 
the cut-based analysis for the three analysis channels. The labels (1) and (2) refer to the leading 
(highest pt) and subleading final-state objects (leptons, Th a d) jets). The variables are defined in 
the text. 
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Tep+lep 

VBF 

Boosted 

Total signal 


11 ±4 


38 ± 13 

Total background 

130 ±7 


3400 ± 64 

Data 


152 


3428 

Tep+had 

Tight VBF 

Loose VBF 

Boosted 

Signal 

8.8 ±3 

17 ±6 

52 ± 17 

Background 

52 ±4 

398 ± 17 

4399 ± 73 

Data 

62 

407 

4435 

Thad^had 

VBF high /+ 

VBF low pH 
tight loose 

Boosted 

high pH low pH 

Signal 

5.7 ± 1.9 

5.2 ± 1.9 

3.7 ± 1.3 

17 ± 6 

20 ± 7 

Background 

59 ±4 

86 ±5 

156 ± 7 

1155 ±28 

2130 ±41 

Data 

65 

94 

157 

1204 

2121 


Table 16. The measured signal and background yields of the cut-based analysis at >Js = 8 TeV in 
the TiepTiep, Ti ep ri ia( j and Thad' r had channels for m# = 125 GeV. The normalisations and uncertainties 
are taken from the global fit. The uncertainties on the predicted yields reflect the full statistical 
and systematic uncertainties. 



Fitted n values 



Multivariate 

analysis 

Cut-based 

analysis 

'Hep'Hep 

8 TeV 

1 Q+ 1 - 0 

0.9 

q 0 + 1.4 

'Hep+had 

8 TeV 

1 1+0-6 
1 " l -0.5 

o 

1 + 
o o 

05 + 

AiadAiad 

8 TeV 

1 O+0.9 

1 (1+0-9 

All channels 

8 TeV 

1 cq+0.47 
1-Oo_ 0 4i 

1 43+0-55 


Table 17. Fitted values of the signal strength for the different channels at yT; = 8 TeV for the 
multivariate and cut-based analyses, measured at m#=125.36 GeV. The results for the combinations 
of all channels are also given. The total uncertainties (statistical and systematic) are quoted. 
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Figure 16. The results of the two-dimensional likelihood fit in the (to#, fi) plane for the cut-based 
analysis for the data taken at yfs = 8 TeV. The signal strength /r is the ratio of the measured signal 
yield to the Standard Model expectation. The 68% and 95% CL contours are shown as dashed and 
solid red lines respectively. The best-fit value is indicated as a red cross. The dashed and solid blue 
lines correspond to the expected 68% and 95% CL contours for to# = 125.36 GeV and /i = 1.43. 
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11 Conclusions 


Evidence for decays of the recently discovered Higgs boson into pairs of tau leptons is pre¬ 
sented. The analysis is based on the full set of proton-proton collision data recorded by the 
ATLAS experiment at the LHC during Run 1. The data correspond to integrated luminosi¬ 
ties of 4.5 fb _1 and 20.3 fb” 1 at centre-of-mass energies of y/~s = 7 TeV and y / s = 8 TeV 
respectively. All combinations of leptonic and hadronic tau decay channels are included and 
event categories selecting both the vector boson fusion and highly boosted rr signatures 
are considered in a multivariate analysis. An excess of events over the expected background 
from other Standard Model processes is found with an observed (expected) significance of 
4.5 (3.4) standard deviations. This excess is consistent with resulting from H —> tt decays 
with mn = 125.36 GeV. The measured signal strength, normalised to the Standard Model 
expectation, is 


fj, = 1.43 t°-£ ( stat.) ±o'il( s yst.) ± 0.09(theory syst.). 

This value is consistent with the predicted Yukawa coupling strength of the Higgs boson in 
the Standard Model. 

The results of the multivariate analysis are cross-checked for the data collected at 
y/s = 8 TeV using a cut-based analysis. The results confirm the findings of the multivariate 
analysis, and an excess with a compatible signal strength is found. A two-dimensional fit of 
the signal strength /i and m# in the cut-based analysis indicates that the observed excess 
is compatible with the rr decay of the Higgs boson with a mass of 125 GeV. 
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